Hierarchical Orderings of Textual Units
نویسنده
چکیده
Text representation is a central task for any approach to automatic learning from texts. It requires a format which allows to interrelate texts even if they do not share content words, but deal with similar topics. Furthermore, measuring text similarities raises the question of how to organize the resulting clusters. This paper presents cohesion trees (CT) as a data structure for the perspective, hierarchical organization of text corpora. CTs operate on alternative text representation models taking lexical organization, quantitative text characteristics, and text structure into account. It is shown that CTs realize text linkages which are lexically more homogeneous than those produced by minimal spanning trees.
منابع مشابه
Distinguishing between Coherent and Incoherent Texts
In this paper, I show that current discourse theories are not able to explain why different orderings of the same textual segments exhibit different properties with respect to coherence. I then propose a criterion of coherence that exploits both the strong tendency of textual units that are associated with certain rhetorical relations to obey a canonical ordering and the inclination of semantic...
متن کاملTextuality: The ‘form’ to Be Focused on in SLA
Due to the special (procedural) nature of the language (verbal communication) ‘knowledge’, the dominant trends in applied linguistics research in the last few decades have been advocating ‘acquisition’ rather than ‘learning’ activities where the main focus in SL & FL education should be on ‘meaning’ while some ‘focus-on-form’ being justified. But the ‘form’ to be ‘focused-on’ is mostly misconce...
متن کاملPreservation of Stochastic Orderings of Interdependent Series and Parallel Systems by Componentwise Switching to Exponentiated Models
This paper discusses the preservation of some stochastic orders between two interdependent series and parallel systems when the survival and distribution functions of all components switch to the exponentiated model. For the series systems, the likelihood ratio, hazard rate, usual, aging faster, aging intensity, convex transform, star, superadditive and dispersive orderings, and for the paralle...
متن کاملSome New Results on Stochastic Orderings between Generalized Order Statistics
In this paper we specify the conditions on the parameters of pairs of gOS’s under which the corresponding generalized order statistics are ordered according to usual stochastic ordering, hazard rate ordering, likelihood ratio ordering and dispersive ordering. We consider this problem in one-sample as well as two-sample problems. We show that some of the results obtained by Franco et al. ...
متن کاملA Study on Preference Orderings of Mathematical expectation, Expected Utility and Distorted Expectation
One of the challenges for decision-makers in insurance and finance is choosing the appropriate criteria for making decisions. Mathematical expectation, expected utility, and distorted expectation are the three most common measures in this area. In this article, we study these three criteria, and by providing some examples, we review and compare the decisions made by each measure.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002